CaPPS: cache partitioning with partial sharing for multi-core embedded systems
نویسندگان
چکیده
As the number of cores in chip multi-processor systems increases, the contention over shared last-level cache (LLC) resources increases, thus making LLC optimization critical, especially for embedded systems with strict area/energy/power constraints. We propose cache partitioning with partial sharing (CaPPS), which reduces LLC contention using cache partitioning and improves utilization with sharing configuration. Sharing configuration enables the partitions to be privately allocated to a single core, partially shared with a subset of cores, or fully shared with all cores based on the co-executing applications’ requirements. CaPPS imposes low hardware overhead and affords an extensive design space to increase optimization potential. To facilitate fast design space exploration, we develop an analytical model to quickly estimate the miss rates of all CaPPS configurations using the applications’ isolated LLC access traces to predict runtime LLC contention. Experimental results demonstrate that the analytical model estimates cache miss rates with an average error of only 0.73% and with an average speedup of 3505× as compared to a cycle-accurate simulator. Due to CaPPS’s extensive design space, CaPPS can reduce the average LLC miss rate by as much as 25% as compared to baseline configurations and as much as 14–17% as compared to prior works.
منابع مشابه
Analytical Modeling of Partially Shared Caches in Embedded CMPs
In modern ubiquitous devices, optimizing shared lastlevel caches (LLCs) in embedded chip multi-processor systems (CMPs) is critical due to the increased contention for limited cache space from multiple cores. We propose cache partitioning with partial sharing (CaPPS) to reduce LLC contention and improve utilization. CaPPS can reduce the average LLC miss rate by 25% and 17% as compared to baseli...
متن کاملDynamic Classification of Program Memory Behaviors in CMPs
Multi-core processors with shared L2 caches can suffer from performance degradations when co-scheduled programs contend for cache resources in a destructive manner. In this work, we propose a new classification algorithm for determining the “personalities” of the programs with respect to their cache sharing behaviors. We first demonstrate that our scheme can more accurately predict when cache s...
متن کاملReconfigurable cache for real-time MPSoCs: Scheduling and implementation
Shared cache in modern multi-core systems has been considered as one of the major factors that degrade system predictability and performance. How to manage the shared cache for real-time multi-core systems in order to optimize the system performance while guaranteeing the system predictability is an open issue. In this paper, we present a reconfigurable cache architecture which supports dynamic...
متن کاملCoordinated Cache Management for Predictable Multi-Core Real-Time Systems
Many modern multi-core processors sport a large shared cache with the primary goal of enhancing the statistic performance of computing workloads. However, due to resulting cache interference among tasks, the uncontrolled use of such a shared cache can significantly hamper the predictability and analyzability of real-time multi-core systems. Software cache partitioning has been considered as an ...
متن کاملLevel-2 Shared Cache versus Level-2 Dedicated Cache for Homogeneous Multicore Embedded Systems
Multicore brings tremendous amount of processing speed. On the contrary, it offers challenges for embedded systems as embedded systems suffer from limited resources. Various cache memory hierarchies are proposed to satisfy the requirements of different systems. Traditionally, level-1 cache memory is dedicated to each core. However, level-2 cache can be shared (like Intel Xenon) or dedicated (li...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Design Autom. for Emb. Sys.
دوره 20 شماره
صفحات -
تاریخ انتشار 2016